added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to… #28

psteinb · 2017-03-13T13:46:58Z

… specify debug flags or gencode flags,

in more detail, I ran into trouble when trying to benchmark more than arraysize = 32*1024*1024. This is due to the fact that by default, not -gencode flags or similar are setup for the nvcc call of the cuda GPU stream. This defaults to an SM that doesn't support an arraysize of this magnitude and hence the init_arrays fails with error code 0xb Invalid arguments. We had the discussion about gencodes before in #8, but the new Makefile based build system doesn't reflect the outcome of #8.

… specify debug flags or gencode flags

tomdeakin · 2017-03-13T13:50:06Z

You don't need to predefine EXTRA_FLAGS in the Makefile for this to work. You can just specify the gencode at the command line with no changes:

make -f CUDA.make EXTRA_FLAGS="-g -gencode=<...>"

psteinb · 2017-03-13T13:57:35Z

might be, but that wouldn't have switched off `-O3` that was hard coded in the Makefile stub.

tomdeakin · 2017-03-13T14:19:26Z

That's true. Are you able to put the -O3 flag in CXXFLAGS instead of EXTRA_FLAGS to be consistent with the other Makefiles, e.g OpenMP.make?

psteinb · 2017-03-13T14:26:02Z

While I am at it, I'll align the cuda make to OpenMP.cmake although I'd recommend using `?=` where possible. OpenMP.cmake does always set variables directly where `?=` might offer more flexiblity. I'll finish the CUDA.make and thus leave the door open for CUDA-with-clang.

tomdeakin · 2017-03-13T15:05:43Z

What flexibility are you seeing with ?=? If you override any variables on the CLI then Make still uses this user defined one and ignores all the assignments with = in the Makefile.

psteinb · 2017-03-13T15:31:03Z

With this: ``` CXXFLAGS?=-O3 -std=c++11 GPUCXX?=nvcc cuda-stream: main.cpp CUDAStream.cu $(GPUCXX) $(CXXFLAGS) -DCUDA $^ $(EXTRA_FLAGS) -o $@ ``` I can pitch in anything on the command line and the Makefile stays quite concise: ``` $ CXXFLAGS="-g -G -std=c++11" make -f CUDA.cmake #produces debug output in case I screwed up $ make -f CUDA.cmake #produces the default -O3 binary ``` Just saying -

jrprice · 2017-03-13T15:39:19Z

Sure, but the same is true without the ?. e.g. if the Makefile looks like this:

CXXFLAGS=-std=c++11 -O3

cuda-stream: main.cpp CUDAStream.cu
        nvcc ${CXXFLAGS} -DCUDA $^ $(EXTRA_FLAGS) -o $@

You can still override CXXFLAGS like this:

make -f CUDA.cmake CXXFLAGS="-g -G -std=c++11"

psteinb · 2017-03-13T15:58:47Z

NB. I still need to dig the GNU make manual. Ha, learned something again! I checked your example with a handful of GNU make builds: **with `?=`** ``` $ cat testme.make CXXFLAGS?=-std=c++11 -O3 cuda-stream: main.cpp CUDAStream.cu nvcc ${CXXFLAGS} -DCUDA $^ $(EXTRA_FLAGS) -o $@ ``` using `?=` is agnostic of the position of the variable definition: ``` $ CXXFLAGS=-bla make -f testme.make -n nvcc -bla -DCUDA main.cpp CUDAStream.cu -o cuda-stream $ make -f testme.make -n CXXFLAGS=-bla nvcc -bla -DCUDA main.cpp CUDAStream.cu -o cuda-stream ``` **without `?=`** ``` $ cat testme.make CXXFLAGS=-std=c++11 -O3 cuda-stream: main.cpp CUDAStream.cu nvcc ${CXXFLAGS} -DCUDA $^ $(EXTRA_FLAGS) -o $@ ``` Now, watch this: ~~~ $ make -f testme.make -n CXXFLAGS=-bla nvcc -bla -DCUDA main.cpp CUDAStream.cu -o cuda-stream $ CXXFLAGS=-bla make -f testme.make -n nvcc -std=c++11 -O3 -DCUDA main.cpp CUDAStream.cu -o cuda-stream ~~~ See the last call? the updated value of CXXFLAGS is not used.

psteinb · 2017-03-17T13:32:09Z

we didn't come to a conclusion on this, did we?

tomdeakin · 2017-03-17T13:45:47Z

Thanks for your investigation into = vs ?= - we'd not realised the implication of this before. I think we will want to move all the Makefiles to using ?= instead of = so that it doesn't matter what position in the command line the variables are.

But for the purposes of this pull request, can you pull out the -O3 flag to CXXFLAGS but use an = for now. We can go though and update all the Makefiles consistently to ?= later.

make variable to cope with clang as CUDA compiler as well

psteinb · 2017-03-17T14:19:21Z

done

tomdeakin · 2017-03-17T14:22:52Z

Merged, thanks for the contribution.

tomdeakin · 2017-03-17T14:25:25Z

Created Issue #30 for switching to ?=

added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to…

ea12f2a

… specify debug flags or gencode flags

put -O3 into CXXFLAGS to comply with OpenMP.make

8c7a801

pulled -O3 out into CXXFLAGS, refactored CUDA compiler into CUDA_CXX

d8cb749

make variable to cope with clang as CUDA compiler as well

tomdeakin merged commit bf57cf5 into UoB-HPC:master Mar 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to… #28

added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to… #28

psteinb commented Mar 13, 2017

tomdeakin commented Mar 13, 2017 •

edited

Loading

psteinb commented Mar 13, 2017 via email

tomdeakin commented Mar 13, 2017

psteinb commented Mar 13, 2017 via email

tomdeakin commented Mar 13, 2017

psteinb commented Mar 13, 2017 via email

jrprice commented Mar 13, 2017

psteinb commented Mar 13, 2017 via email •

edited

Loading

psteinb commented Mar 17, 2017

tomdeakin commented Mar 17, 2017

psteinb commented Mar 17, 2017

tomdeakin commented Mar 17, 2017

tomdeakin commented Mar 17, 2017

added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to… #28

added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to… #28

Conversation

psteinb commented Mar 13, 2017

tomdeakin commented Mar 13, 2017 • edited Loading

psteinb commented Mar 13, 2017 via email

tomdeakin commented Mar 13, 2017

psteinb commented Mar 13, 2017 via email

tomdeakin commented Mar 13, 2017

psteinb commented Mar 13, 2017 via email

jrprice commented Mar 13, 2017

psteinb commented Mar 13, 2017 via email • edited Loading

psteinb commented Mar 17, 2017

tomdeakin commented Mar 17, 2017

psteinb commented Mar 17, 2017

tomdeakin commented Mar 17, 2017

tomdeakin commented Mar 17, 2017

tomdeakin commented Mar 13, 2017 •

edited

Loading

psteinb commented Mar 13, 2017 via email •

edited

Loading